DE eng

Search in the Catalogues and Directories

Page: 1 2
Hits 1 – 20 of 23

1
Vocapia-LIMSI System for 2020 Shared Task on Code-switched Spoken Language Identification
In: The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities ; https://hal.archives-ouvertes.fr/hal-03091792 ; The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities, Oct 2020, Shanghai, China (2020)
BASE
Show details
2
Challenges in Audio Processing of Terrorist-Related Data
In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02415176 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
BASE
Show details
3
Challenges in Audio Processing of Terrorist-Related Data
In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02387373 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
BASE
Show details
4
Language Recognition for Dialects and Closely Related Languages
In: Odyssey 2016 ; https://hal.archives-ouvertes.fr/hal-01744188 ; Odyssey 2016, Jun 2016, Bilbao, Spain (2016)
BASE
Show details
5
Improving Data Selection for Low Resource STT and KWS
BASE
Show details
6
Lexical speaker identification in TV shows
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690342 ; Multimedia Tools and Applications, Springer Verlag, 2015, 74 (4), pp.1377 - 1396. ⟨10.1007/s11042-014-1940-3⟩ (2015)
BASE
Show details
7
Traduction de la parole dans le projet RAPMAT
In: Journées d'Études sur la Parole ; https://hal.archives-ouvertes.fr/hal-01843418 ; Journées d'Études sur la Parole, Jan 2014, Le Mans, France (2014)
BASE
Show details
8
Comparing decoding strategies for subword-based keyword spotting in low-resourced languages
In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01843408 ; Annual Conference of the International Speech Communication Association , ISCA, Sep 2014, Singapore, Singapore (2014)
BASE
Show details
9
Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification
In: ISSN: 1070-9908 ; IEEE Signal Processing Letters ; https://hal.archives-ouvertes.fr/hal-01690336 ; IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers, 2014, 21 (9), pp.1040 - 1044. ⟨10.1109/LSP.2014.2323432⟩ (2014)
Abstract: International audience ; Most speaker recognition systems rely on short-term acoustic cepstral features for extracting the speaker-relevant information from the signal. But phonetic discriminant features, extracted by a bottleneck multi-layer perceptron (MLP) on longer stretches of time, can provide a complementary information and have been adopted in speech transcription systems. We compare the speaker verification performance using cepstral features, discriminant features, and a concatenation of both followed by a dimension reduction. We consider two speaker recognition systems, one based on maximum likelihood linear regression (MLLR) super-vectors and the other on a state-of-the-art i-vector system with two session variability compensation schemes. Experiments are reported on a standard configuration of NIST SRE 2008 and 2010 databases. The results show that the phonetically discriminative MLP features retain speaker-specific information which is complementary to the short-term cepstral features. The performance improvement is obtained with both score domain and feature domain fusion and the speaker verification equal error rate (EER) is reduced up to 50% relative, compared to the best i-vector system using only cepstral features.
Keyword: [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; [INFO]Computer Science [cs]; bottleneck features; i-vector; LDA; multi-layer perceptron; PCA; PLDA; Speaker verification
URL: https://doi.org/10.1109/LSP.2014.2323432
https://hal.archives-ouvertes.fr/hal-01690336/document
https://hal.archives-ouvertes.fr/hal-01690336/file/double-final.pdf
https://hal.archives-ouvertes.fr/hal-01690336
BASE
Hide details
10
Lattice MLLR based m-vector system for speaker verification
In: IEEE International Conference on Acoustics, Speech, and Signal Processing ; https://hal.archives-ouvertes.fr/hal-01836461 ; IEEE International Conference on Acoustics, Speech, and Signal Processing, Jan 2013, Vancouver, Canada (2013)
BASE
Show details
11
Unsupervised Speaker Identification using Overlaid Texts in TV Broadcast
In: Proceedings of the 13th Annual Conference of the International Speech Communication Association (Interspeech) ; Interspeech 2012 - Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-00767427 ; Interspeech 2012 - Conference of the International Speech Communication Association, Sep 2012, Portland, OR, United States. 4p (2012)
BASE
Show details
12
Recherche par le contenu dans des documents audiovisuels multilingues
In: ISSN: 1279-5127 ; EISSN: 1963-1014 ; Document Numérique ; https://hal.inria.fr/hal-00953796 ; Document Numérique, Lavoisier, 2010, 13 (1), pp.229-246 (2010)
BASE
Show details
13
Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet
In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-00953696 ; Multimedia Tools and Applications, Springer Verlag, 2010, 48 (1), pp.123-140. ⟨10.1007/s11042-009-0377-6⟩ (2010)
BASE
Show details
14
Automatic speech recognition for under-resourced languages: application to Vietnamese language
In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 17 (2009) 8, 1471-1482
BLLDB
OLC Linguistik
Show details
15
Exploitation d'un corpus bilingue comparable pour la création d'un système de traduction probabiliste Vietnamien - Français
In: TALN ; TALN 2009, Senlis, 24-26 juin 2009 ; https://hal.archives-ouvertes.fr/hal-00959202 ; TALN 2009, Senlis, 24-26 juin 2009, 2009, Unknown, pp.x-x (2009)
BASE
Show details
16
Mining a comparable text corpus for a Vietnamese - French statistical machine translation system
In: Fourth Workshop on Statistical Machine Translation ; https://hal.archives-ouvertes.fr/hal-01393602 ; Fourth Workshop on Statistical Machine Translation, 2009, Athens, Greece. pp.165 - 172, ⟨10.3115/1626431.1626466⟩ ; http://www.statmt.org/wmt09/ (2009)
BASE
Show details
17
Recherche par le contenu dans des documents audiovisuels multilingues
In: Actes de la conférence CORIA ; https://hal.inria.fr/hal-00954025 ; Actes de la conférence CORIA, 2009, Giens, France. pp.67-82 (2009)
BASE
Show details
18
Content-Based Search in Multilingual Audiovisual Documents using the International Phonetic Alphabet
In: 7th International Workshop on Content-Based Multimedia Indexing (CBMI 2009) ; https://hal.inria.fr/hal-00953855 ; 7th International Workshop on Content-Based Multimedia Indexing (CBMI 2009), 2009, Chania, Crete. 3-5 June 2009 (2009)
BASE
Show details
19
Normalisation et alignement de corpus français et vietnamiens : Format et Logiciels
In: Actes JATD 2008 ; journées internationales d'analyse statistique des données textuelles ; https://hal.archives-ouvertes.fr/hal-01705630 ; journées internationales d'analyse statistique des données textuelles, Jun 2008, Lyon, France (2008)
BASE
Show details
20
Acoustic-Phonetic Unit Similarities for Context Dependent Acoustic Model Portability
BASE
Show details

Page: 1 2

Catalogues
0
0
1
0
0
0
0
Bibliographies
1
0
0
0
0
0
0
0
0
Linked Open Data catalogues
0
Online resources
0
0
0
0
Open access documents
22
0
0
0
0
© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern